PyDigger - unearthing stuff about Python


NameVersionSummarydate
exness-data-preprocess 0.7.2 Professional Exness forex tick data preprocessing with optimal compression (Parquet Zstd-22) and DuckDB OHLC generation. Provides efficient storage (9% smaller than ZIP) with lossless precision and direct queryability. 2025-10-29 21:17:18
fsspec-utils 0.2.7.1 Enhanced utilities and extensions for fsspec filesystems with multi-format I/O support 2025-10-28 14:45:38
rugo 0.1.20 Parquet Metadata Reader 2025-10-26 08:29:30
parquetframe 1.0.1 A universal data processing framework with multi-engine support (pandas, Polars, Dask) and multi-format I/O (CSV, JSON, Parquet, ORC, Avro) with intelligent backend selection 2025-10-19 18:21:30
forklift-etl 0.1.4 A powerful data processing and schema generation tool with PyArrow streaming, validation, and S3 support 2025-10-19 14:38:11
joinem 0.11.1 CLI for fast, flexbile concatenation of tabular data using Polars. 2025-10-19 12:41:00
graphique 2.0 GraphQL service for python dataframes and parquet datasets. 2025-10-16 02:37:12
parq-cli 0.0.3 A powerful command-line tool for inspecting and analyzing Apache Parquet files 2025-10-14 12:51:25
tablediff-arrow 0.1.0 Fast, file-based diffs for Parquet/CSV/Arrow (local or S3) with keyed comparisons, per-column tolerances, and HTML/CSV reports—built on Apache Arrow. 2025-10-13 05:51:53
tablefaker 1.8.0 A Python package to generate fake tabular data. Get data in pandas dataframe or export to Parquet, DeltaLake, Csv, Json, Excel or Sql 2025-10-10 19:06:16
langchain-callback-parquet-logger 2.0.2 A Parquet-based callback handler for logging LangChain LLM interactions 2025-09-18 21:40:13
oss-metrics-kit 0.1.1 Unified toolkit to fetch, normalize, score, and export OSS contribution metrics. 2025-09-17 16:05:30
oups 2025.9.5 Out-of-core pipelines over ordered data: StatefulLoop, stateful ops, and ordered Parquet Store. 2025-09-10 07:25:47
shardate 2025.9.9.2 A lightweight Python library for efficiently reading year-month-day partitioned Parquet datasets. 2025-09-09 04:09:27
tacozip 0.6.0 TACO ZIP: ZIP64 archive with TACO Ghost supporting up to 7 metadata entries 2025-09-06 19:54:50
datatalk-cli 0.1.1 Query CSV and Parquet data with natural language 2025-09-01 23:38:55
pyquetmsMS 0.1.1 Memory-efficient mzML to Parquet converter for mass spectrometry files 2025-09-01 00:19:35
PyquetMS 0.1.0 Memory-efficient mzML to Parquet converter for mass spectrometry files 2025-08-31 23:51:16
parquetconv 0.2.1 A command-line tool for converting between Parquet and CSV file formats 2025-08-25 20:58:42
tidy-viewer-py 0.3.0 A cross-platform data pretty printer that uses column styling to maximize viewer enjoyment. Supports CSV, Parquet, Pandas, and Polars DataFrames with automatic data type detection and display. 2025-08-20 23:39:52
hourdayweektotal
11114649005332747
Elapsed time: 5.12010s